An Online Convex Optimization Approach to Blackwell's Approachability

نویسنده

Nahum Shimkin

چکیده

The notion of approachability in repeated games with vector payoffs was introduced by Blackwell in the 1950s, along with geometric conditions for approachability and corresponding strategies that rely on computing steering directions as projections from the current average payoff vector to the (convex) target set. Recently, Abernethy, Batlett and Hazan (2011) proposed a class of approachability algorithms that rely on the no-regret properties of Online Linear Programming for computing a suitable sequence of steering directions. This is first carried out for target sets that are convex cones, and then generalized to any convex set by embedding it in a higher-dimensional convex cone. In this paper we present a more direct formulation that relies on the support function of the set, along with suitable Online Convex Optimization algorithms, which leads to a general class of approachability algorithms. We further show that Blackwell’s original algorithm and its convergence follow as a special case.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lecture : Blackwell's Approachability Theorem . Blackwell's Approachability Theorem

Counter-example to (.): If S = {(p,q) : p = q} and r(p,q) = (p,q), one can trivially, for all q, choose p(q) = q to guarantee that r(p(q),q) ∈ S. It is however not possible to find a p which works for all q, indeed the only p which works for a given q is p = q. However, the duality statement (.) holds when S is a half-space {x | v ·x ≥ c}. To see this, define a zero-sum game with scalar p...

متن کامل

Zero - Sum Games with Vector - Valued Payoffs

In this lecture we formulate and prove the celebrated approachability theorem of Blackwell, which extends von Neumann's minimax theorem to zero-sum games with vector-valued payoffs [1]. (The proof here is based on the presentation in [2]; a similar presentation was given by Foster and Vohra [3].) This theorem is powerful in its own right, but also has significant implications for regret minimiz...

متن کامل

Online Learning and Blackwell Approachability with Partial Monitoring: Optimal Convergence Rates

Blackwell approachability is an online learning setup generalizing the classical problem of regret minimization by allowing for instance multi-criteria optimization, global (online) optimization of a convex loss, or online linear optimization under some cumulative constraint. We consider partial monitoring where the decision maker does not necessarily observe the outcomes of his decision (unlik...

متن کامل

A Learning Scheme for Blackwell’s Approachability in MDPs and Stackelberg Stochastic Games

The notion of approachability was introduced by Blackwell ([8]) in the context of vector-valued repeated games. The famous ‘Blackwell’s approachability theorem’ prescribes a strategy for approachability, i.e., for ‘steering’ the average vector-cost of a given player towards a given target set, irrespective of the strategies of the other players. In this paper, motivated from the multi-objective...

متن کامل

A Learning Scheme for Approachability in MDPs and Stackelberg Stochastic Games

The notion of approachability was introduced by Blackwell [1] in the context of vector-valued repeated games. The famous ‘Blackwell’s approachability theorem’ prescribes a strategy for approachability, i.e., for ‘steering’ the average vector cost of a given agent towards a given target set, irrespective of the strategies of the other agents. In this paper, motivated by the multi-objective optim...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Journal of Machine Learning Research

دوره 17 شماره

صفحات -

تاریخ انتشار 2016

An Online Convex Optimization Approach to Blackwell's Approachability

نویسنده

چکیده

منابع مشابه

Lecture : Blackwell's Approachability Theorem . Blackwell's Approachability Theorem

Zero - Sum Games with Vector - Valued Payoffs

Online Learning and Blackwell Approachability with Partial Monitoring: Optimal Convergence Rates

A Learning Scheme for Blackwell’s Approachability in MDPs and Stackelberg Stochastic Games

A Learning Scheme for Approachability in MDPs and Stackelberg Stochastic Games

عنوان ژورنال:

اشتراک گذاری